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communication involves both auditory and visual modalities, providing robustness and naturalness 

shape and color. In addition, if movement of an object is detected, this information is used 

a known face in a restricted area around the last location. During tracking the system learns features of 

werner.ira.uka.de/papers/multimodal/94.acssc.ps.gz 

Incremental Focus of Attention for Robust Visual Tracking - Tovama. Hager (1996) (Correct) (11 citations) 
Incremental Focus of Attention for Robust Visual Tracking Kentaro Toyama and Gregory D. Hager 
further attention. For example, if the target object is a falling apple, one layer of the framework 
to return approximate information on feature location or configuration. 1 Introduction Robustness 
www.cs.yale.edu/HTML/YALE/CS/HyPians/toyama/layered.ps.gz 

Single Lens Stereo with a Plenoptic Camera - Adelson, Wang (1992) (Correct) (16 citations) 
of a cone of light that Leonardo called a "visual pyramid.The space surrounding an object is 
optical structure, one can infer the depths of objects in the scene, i.e.one can achieve "single lens 
that would be seen by a pinhole camera at a given location, a) b) c) d) Fig. 2. a) Pinhole camera 
www-bcs.mit.edu/people/adelson/./publications/postscript/plenoptic.ps,Z 

Dynamic registration Corrections in Augmented-Reality Systems - Baiura, Neumann (1995) (Correct) 
(9 citations) 

This paper addresses the problem of correcting visual registration errors in video-based 
registration between real and computergenerated objects in combined images is critically important for 
ing system and specifies the location of a tracking element's position on the user's 
usc.edu/pub/graphics/papers/vrais. ps.Z 

Virtual Notepad: Handwriting in Immersive VR - Poupvrev, Tomokazu, Weghorst (1998) (Correct) 
(4 citations) 

This virtual pen provides the user with a constant visual reference for the location of the entry point. As 
the user to add audio annotations to virtual objects. Annotations are represented as a small marker 
the user with a constant visual reference for the location of the entry point. As the user draws on the 
www.hitl.washington.edu/publications/r-97-46/r-97-46.ps 

A System for Automated Site Model Acguisition - Collins, Javnes, Stolle.. (1995) (Correct) (7 citations) 
the model to be overlaid on the image to aid visual change detection and verification of expected 
and manipulating images, camera models, object models and terrain models, and for keeping track 
and X-Y coordinates represent their horizontal location in the site. 2.3 Camera models For each image 
vis-ftp.cs.umass.edu/Papers/collins/spie95.ps.gz 

Face Tracking and Pose Representation - McKenna, Collins (1996) (Correct) (7 citations) 
depth. Principal components analysis was used to visualise the manifolds described by pose changes, 
system based on an integrated motion-based object tracking and model-based face detection 
produces a zero-crossing in S(x y t) at the location of the edge in the middle frame of the "history" 
www.dcs.qmw.ac.uk/research/vision/articles/bmvc2.ps.gz 

Nonparametric Recognition of Nonrigid Motion - Polana, Nelson (1995) (Correct) (7 citations) 
sequences are described. 1 Introduction Visual motion has long been considered a vital source of 
the sources of different motions, identifying objects moving relative to the surrounding environment, 
ftp.cs.rochester.edu/pub/papers/robotics/95.tr575.Nonparametric_recognition_of_nonrigid_motion.ps.gz 

Real-Time Hand Tracking and Gesture Recognition Using Smart Snakes - Heap (1995) (Correct) 
(6 citations) 

hand and recognise any gestures made, using only visual input, is taken entirely for granted by humans, 
model that can be used to track any 2D deformable object. 1 Introduction Our hands play a very important 
can be projected onto an image by specifying its location, in terms of scale s, rotation ^-translation 
ftp.orl.co.uk/pub/docs/ORL/tr.95. 1. ps.Z 

Tracking Human Motion Using Multiple Cameras - Cai, Aggarwal (1996) (Correct) (6 citations) 

(urn v m )uN vN )T As for the visual features, we use an N dimensional feature vector 

a fixed camera [1 , 2] to tracking non-background objects in a single moving camera [3]The studies in 
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frames taken by cameras mounted in various locations. Experimental results from real data show the 
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Computer-Aided Image-Guided Bone Fracture Surgery: Modeling. .. - Tockus Joskowicz (1998) (Correct) 
(3 citations) 

Image-Guided Bone Fracture Surgery: Modeling, Visualization, and Preoperative Planning L. Tockus 
is used to provide accurate, real-time spatial object positions with optical cameras following infrared 
the surgeon must mentally reconstruct the location of the parts in space and time, manipulate the 
www.cs.huji.ac.il/-josko/system98.ps 

Learning, Positioning, and Tracking Visual Appearance - Shree Navar (1994) (Correct) (6 citations) 
Learning, Positioning, and Tracking Visual Appearance Shree K. Nayar, Hiroshi Murase, and 
determining the mapping between robot position and object appearance. The robot is first moved through 
either automatically position itself at a desired location with respect to an object, or accurately follow 
ftp.cs.columbia,edu/pub/CAVE/papers/nayar/nayar-murase-nene-track_icra-94.ps.gz 

Understanding People Pointing: The Perseus System - Kahn, Swain (1995) (Correct) (5 citations) 
In this paper we present Perseus, a purposive visual system used by our robot, CHIP, to locate objects 
visual system used by our robot, CHIP, to locate objects being pointed at by people. Perseus uses 
Providing a verbal description of the trash's location to CHIP is awkward it is far more natural to 
cs-www.uchicago.edu/-kahn/Papers/postscript/iscv95.ps 

Finger Tracking as an Input Device for Augmented Reality - James Crowley (1995) (Correct) (5 citations) 
Abstract This paper concerns techniques for visual tracking of pointing devices. The first section 
evolution. The barrier between physical objects (paper, pencils, calculators) and their 
provides a method in which the most probable location of the pointing device is determined by 
pandora.imag.fr/Prima/jlc/FG95.ps.gz 

Tracking Faces - McKenna, Gong (1996) (Correct) (5 citations) 

approaches depend on a robust method for grouping visual motions consistently over time [10]They tend to 
and a temporally consistent list of moving objects was maintained. Objects were tracked using 
a temporal zero-crossing in S(x, y, t) at the location of the edge in the middle frame of the "history" 
www.dcs, qmw.ac.uk/research/vision/articles/iwafgr-tracking.ps.gz 

An Intelligent Observer - Becker Gonz'alez-Ba (1995) (Correct) (5 citations) 

performs its tasks, the system provides real-time visual feedback to the user. We have implemented a 

one or more cameras which allow it to track objects while at the same time sensing its own location. 

objects while at the same time sensing its own location. It interacts with a human user who issues 

robotics.stanford.edu/-hhg/doc/iser95/iser95.ps.gz 

Highlight and Reflection-Independent Multiresolution.. - Ofek, Shilat.. (1997) (Correct) (3 citations) 
that texture maps are essential for adding to the visual content of the rendered image. Extraction of 
distortions 2) it can extract textures from objects with any known 3-D geometric structure 
when the texture is fixed in an inconvenient location (e.g. on the outside)when illumination 
www.cs.huji.ac.il/papers/IP/multiresoiution-texture.ps.gz 

Providing a Low Latency User Experience in a High Latency.. - Conner, Holden (1997) (Correct) (3 citations) 
Center for Computer Graphics and Scientific Visualization Providence, Rl 02912 lsh@cs.brown.edu 
motion or derivative information provided by an object itself, the dead reckoning system calculates the 
a continuous motion, not a discrete change of location. Other work has developed techniques for 
www.es. brown.edu/research/graphics/research/pub/papers/i3d97-blurghost/i3d96, ps.gz 

Recognizing Hand Gestures - Davis, Shah (1994) (Correct) (5 citations) 
(SFM) method in which the 3-D visual interpretation of hand gestures is used in a 
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as a means of communication, e.g. pointing to an object to bring someone's attention to the object, 
Using stereo images, their system uses the 3-D location of fingers rather than the 2-D location. The 

vismod.www.media.mit.edu/^jdavis/OldPapers/eccv.ps.Z 

A Continuous Media Transport and Orchestration Service - Andrew Campbell (1992) (Correct) (7 citations) 

have been implemented including an audio/visual telephone and a video disc jockey console. 3. 

and transport services are integrated into an object-based distributed multimedia application 

type (ADT) interfaces which are accessed ina# location independent fashion. Invocation is implemented* 

www.cs.uit.no/-weihai/Mlv1sem.v97/pensum/Campbell.ps 

Coordination of perceptual processes for Computer.. - Coutaz, Eterard, Crowley (1996) (Correct) (4 citations) 
face tracking, data fusion, integration of visual processes, media space. 1. Introduction Computer 
a result, peripheral awareness of distant people, objects, and events is lost. In addition, the static 
(e.g.Vphone and exploration of a distant location such as a public area using a virtual window) 
iihm.imag.fr/pubis/1996/FG96_Comedi.ps.gz 

Tracking Objects By Color Alone - Rasmussen, Tovama, Hager (1996) (Correct) (4 citations) 
devices. 1 Introduction Tracking is a common visual task with many uses. By maintaining focus on 
Tracking Objects By Color Alone Christopher Rasmussen, Kentaro 

with minimal specularity or choosing color sample locations on them away from specularities, we have found 
www.cs.yale.edu/HTML/YALE/CS/HyPlans/rasmussen/lib/papers/rr1 1 1 4.ps.gz 

The Visual Display Transformation for Virtual Reality - Robinett, Holloway (1995) (Correct) (4 citations) 
Ent S I G I L L Urn Lux Libertas The Visual Display Transformation For Virtual Reality Warren 
series of transformations used to map points from object coordinates to screen coordinates. Virtual 
the HMD is called the user, and also has a location and orientation within the virtual world. A good 
cs . ru .ac .za/homes/g97rc001 /papers/94-03 1 . ps .gz 

Security of Web Browser Scripting Languages: Vulnerabilities.. - Anupam, Mayer (1998) (Correct) 
(2 citations) 

scripting language that looks a lot like Visual Basic. It is loosely typed and object based. It 
to refer to both strains.JavaScript is object-based in the sense that it uses built-in and user 
document when it is loaded by the browser. The location object represents the URL of the current 

www.beli-labs.com/user/alain/papers/usenix98.ps.gz 

W4: Who? When? Where? What? A Real Time System for.. - Haritaoglu, Harwood.. (1998) (Correct) 
(2 citations) 

Park, MD 20742 Abstract W 4 is a real time visual surveillance system for detecting and tracking 
determine types of interactions between people and objects, and to overcome the inevitable errors and 
occupancy overlap tests between the predicted locations of objects and the locations of detected 
www.umiacs.umd.edu/users/lsd/vsam/fg98.ps.gz 

Using the CONDENSATION Algorithm for Robust.. - Dellaert, Burgard.. (1999) (Correct) (1 citation) 
to globally localize the camera platform, given a visual map of the environment. Based on these two 
of the camera platform rather than tracking an object in the scene. In addition, it can also be used to 
as well as tracking the robot's position once its location is known. Vision has long been advertised as 
www.ri.cmu.edu/pub_files/pub1/dellaert_frank_1999_1/dellaert_frank_1999__1.ps.gz 

Starfield Information Visualization with Interactive Smooth.. - Ninad Jog (1995) (Correct) (3 citations) 

1 Starfield Information Visualization with Interactive Smooth Zooming Ninad 

through a space -say a world of 3-D graphical objects as in virtual reality applications, or in an 

- thereby making the starfield display a map of locations. Other databases exploit the starfield display 

ftp.cs.umd.edu/pub/papers/papers/ncstrl. umcp/CS-TR-3286/CS-TR-3286.ps.Z 

A Self-organizing Neural Network Architecture for.. - Cameron, Grossberg.. (1995) (Correct) (3 citations) 

1995 3 1 . Introduction: Optic Flow, Heading, and Visual Navigation As we move through the world, we 

of heading, scene depth, and moving object locations. These representations are used to 

of heading, scene depth, and moving object locations. These representations are used to reactively 

cns-web.bu.edu/-guenther/cameron_grossberg_guenther__article.ps.Z 
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A Modular Visual Tracking System - Wessler (1995) (Correct) (3 citations) 
A Modular Visual Tracking System by Mike Wessler A.B. Harvard 
: 20 2.1 .3 Object recognition (What" 
www.ai.mit.edu/people/wessler/main.ps.Z 
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Reflection of Presence: Toward more natural and.. - Agamanolis, Westner, .. (1997) (Correct) (2 citations) 
looking at each other through a real mirror. Using visual and auditory cues, segmented images of 
and collaboratively manipulate and annotate media objects in the background. The system is novel in that 
even though they are all in separate remote locations. However, instead of seeing a reflection of 
dsmall.www.media. mit.edu/-vmb/papers/spie97stefan. ps.Z 

Velocity and Disparity Cues for Robust Real-Time Binocular.. - Rougeaux, Kuniyoshi (1997) (Correct) 
(2 citations) 

found in the primary functions of biological visual systems to robustly track moving targets in 
algorithm quickly locates independently moving objects for target acquisition and provides a reliable 
to adapt the horopter geometry to the target location. The system takes advantage of the optical 
www.etl.go.jp:8080/etl/robotics/Projects/Humanoid/postscript/cvpr_97.ps.gz 

Controlled Active Exploration of Uncalibrated Environments - Smith, Brandt.. (1994) (Correct) (3 citations) 

Of the sensors available to a robotic agent, visual sensors provide information that is richer and 

with little or no a priori knowledge of the object- and camerarelated parameters to robustly 

of the controller indicates as the possible location of the minimum. In the general case, search time 

www.es. colorado.edu/-sbrandt/papers/CVPR94. ps.Z 

Holographic stereograms as discrete imaging systems - Halle (1994) (Correct) (3 citations) 

the huge amount of information needed to produce a visually acceptable three-dimensional image. Because of 

Ma 02139 Usa Abstract Unlike Holograms Of Real Objects, Holographic Stereograms Consist Of Information 

used by humans to determine the three-dimensional location of an object. If a single image of the point is 

ftp.media.mit.edu/pub/halazar/discrete-preprint.ps.Z 

Mediated reality - Mann (1994) (Correct) (3 citations) 

desire to take away, alter, or more generally to visually * mediate' real objects, using a body-worn 
intent of Augmented Reality (AR) is to add virtual objects to the real world. A typical AR apparatus 
www.wearcam.org/mediatedreality/TR-260.ps.gz 

A Parallel Approach to Tracking Edge Segments in Dynamic Scenes - Mirmehdi, Ellis (1991) (Correct) 
(4 citations) 

(AGV)The aim of this project is to provide visual input to the vehicle control system to aid 

are in the tracking and identification of discrete objects moving through the scene, and in estimating the 

visual input to the vehicle control system to aid location and navigation. The current system is targetted 

www.cs.bris.ac.uk/Tools/Reports/Ps/mirmehdi-ivc93.ps.gz 

Video-Rate Z Keying: A New Method for Merging Images - Kanade, Oda, Yoshida.. (1995) (Correct) 
(2 citations) 

merging real and synthetic images in real time. In visual media communication and display, it is often 
Chroma keying, however, simply puts real world objects in the foreground of the synthetic image, and 
by using a computer vision technique to track the location of a human object [4]These systems, however, 
www.cs.cmu.edu/afs/cs/project/stereo-machine/www/95-38.ps.gz 

Inductive Learning of Feature-Tracking Rules for Scientific. - Arunava Baneriee (1995) (Correct) (2 citations) 
Learning of Feature-Tracking Rules for Scientific Visualization Arunava Banerjee Haym Hirsh Thomas Ellman 
in a more efficient system that can match up objects across large time steps without inspecting 
if each of O 1 O n are close to the location of O as well as to one another and the sum of 

www.es. rutgers.edu/-arunava/papers/ijcai95,ps.Z 

Keeping Your Eye on the Ball: Tracking Occluding Contours of.. - Tovama, Hager (1995) (Correct) 
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P.O. Box 208285 New Haven, CT 06520 Abstract Visual tracking is prone to distractions, where 

Ball: Tracking Occluding Contours of Unfamiliar Objects without Distraction Kentaro Toyama and Gregory 

dynamic models can be used to predict feature location [2, 3, 9]ln constraintbased tracking, no 

ftp.cs.yale.eduA/VWW/HTMLA"ALE/CS/HyPlans/toyama/papers/iros95short.ps.gz 

Characterization of the Spatial Frequency Spectrum in Perception - Ko Sakai (1995) (Correct) (2 citations) 

frequency spectrum [3]We propose that the visual system uses a strategy of characterizing the 

the most important visual cues to the shape of an object is the orderly change in texture that occurs as a 

by determining the peak frequency at each spatial location, and then averaging these frequency values over 

www.neuroengineering.upenn.edu/papers/ko/sakai_finkel.josa_1995.ps.gz 

From Gaze to Focus of Attention - Stiefelhagen. Finke, Yang, Waibel (1998) (Correct) (1 citation) 
not only use verbal means, but also a variety of visual cues for communication. For example, people use 
focus of attention. The knowledge of a person's object of interest helps us effectively communicate with 
persons head movements as well as the relative locations of probable targets of interest in a room. Over 
www.is.cs.cmu.edu/papers/multimodal/PUI98/PUI98-rainer.ps.gz 

Joint Probabilistic Techniques for Tracking Multi-Part Objects - Christopher Rasmussen (1998) (Correct) 
(1 citation) 

objects such as people and cars comprise many visual parts and attributes, yet image-based tracking 
Probabilistic Techniques for Tracking Multi-Part Objects Christopher Rasmussen Gregory D. Hager Center 
of the head can be derived from the shirt's image location and scale. If the person walks behind a piece of 
ftp.cs.yale.edu/WWW/HTML/YALE/CS/HyPlans/rasmussen/lib/papers/cvpr98.ps.gz 

Real-Time Visual Tracking Using Correlation Techniques - Eklund, Ravichandran.. (1994) (Correct) 
(2 citations) 

Real-Time Visual Tracking Using Correlation Techniques Mark W. 

and does not rely on a previous model of the object the training image for filter synthesis is 

sneezy.sri.com/-ravi/Papers/wacv-TRACK.ps 

An Integrated Traffic and Pedestrian Model-Based.. - Remagnino, Baumberg.. (1997) (Correct) (1 citation) 
integrated vision system in which two autonomous visual modules are combined to interpret a dynamic 
employs a 3D model-based scheme to track rigid objects such as vehicles. The second module uses a 2D 
its orientation and then its ground plane (GP) location .In the following, we outline our solutions to 
www.cvg.cs.reading.ac.uk/papers/ps/CVG9702.ps.gz 

Video Motion Capture - Bregler, Malik (1997) (Correct) (1 citation) 

of the person whose motion is to be captured. For visual tracking we introduce the use of a novel 

is very challenging, compared to tracking other objects such as footballs, robots or cars. These 

T describes the pixel displacement dependent on location (x y) and model parameters OE. For example, a 

http.cs. berkeley.edu/-bregler/bregler_malik_muy.ps.gz 

GARGOYLE: Context-sensitive active vision for mobile robots - Prokopowicz, Firbv, Kahn, .. (1996) (Correct) 
(1 citation) 

and performance: on-the-AEy conguration of visual routines that exploit up-to-the-second context 
For example, the visual routine for nding objects by shape uses a color model of the object to 
restriction of viewpoint search based on image location may ameliorate the inherent combinatorics of 
www.cs.uchicago.edu/-kahn/Papers/postscript/icpr96.ps 

A Visually Oriented Representation of Planar Relative Position - Jean-Marc Odobez (1996) (Correct) 
(1 citation) 

A visually oriented representation of planar relative 

the position of a camera with respect to an object (more precisely, two points of an object)lt is 
from a known position in the world, the current location of the robot is computed using odometry. 
ftp.cis.upenn.edu/pub/grasp/technical-reports/401.ps.gz 

Plan Representations for Picking Up Trash - Firbv, Prokopwicz, Swain (1995) (Correct) (1 citation) 
trash cleanup task breaks down naturally as: ffl Visual skills for nding and identifying trash and for 
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are used to pick up trash: find-small-floor-object uses fuzzy classication to identify segmented 
the object. Both nding skills set the target-location with an (x y) coordinate for the desired target 
www.es. uchicago.edu/pub/users/peterp/ictai95. ps.Z 

Model-based vehicle detection and classification.. - Sullivan, Baker.. (1996) (Correct) (1 citation) 

to simplify our previous model-based methods for visual tracking of vehicles for use in a real-time 

at highly predictable traffic flow. A major objective is to develop automatic means to detect, 

1,000 points and needs to be evaluated at all locations covering some image region. The approach is 

www.cvg.cs.reading.ac.uk/papers/ps/BMVC96_crc.ps.gz 

Detecting, localizing and grouping repeated scene elements.. - Leung. Malik (1996) (Correct) (1 citation) 
in an iconic matching scheme to provide another visual cue for recognition. 2 Relevant previous work 
this representation good for? The two short term objectives are: 1 . Grouping of these repeated elements 
map that best transforms the image patch at one location to the other. The approach we propose consists 
www.es. berkeley.eduHeungt/Research/ECCV96__final.ps.gz 
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